If no information gain implies no disturbance, then any discrete physical theory is 

classical 
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It has been suggested that nature could be discrete in the sense that the underlying state space 
of a physical system has only a finite number of pure states. For example, the Bloch ball of a single 
qubit could be discretized into small patches and only appear round to us due to experimental 
limitations. Here, we present a strong physical argument for the quantum theoretical property that 
every state space (even the smallest possible one, the qubit) has infinitely many pure states. We 
propose a simple physical postulate which dictates that in fact the only possible discrete theory is 
classical mechanics. More specifically, we postulate that no information gain implies no disturbance 
— or, read in the contrapositive, that disturbance leads to some form of information gain. In a 
theory like quantum mechanics where we already know that the converse holds, i.e. information gain 
does imply disturbance, this can be understood as postulating an equivalence between disturbance 
and information gain. What's more, we show that non-classical discrete theories are still ruled out 
even if we relax the postulate to hold only approximately in the sense that no information gain only 
causes a small amount of disturbance. Finally, our postulate also rules out popular generalizations 
such as the PR-box that allows non-local correlations beyond the limits of quantum theory. 



In contrast to classical theory, quantum theory has the 
remarkable property that the state space of every sys- 
tem has continuously many pure states. These are states 
which can be seen as states of maximal knowledge: They 
cannot be prepared by flipping a (possibly biased) coin 
to decide between two different preparation procedures 
to be executed, hiding the outcome of the coin flip. Even 
the qubit, the smallest possible system with no more 
than two perfectly distinguishable states, has continu- 
ously many such states. This non-discreteness of quan- 
tum theory contrasts sharply with classical theory, where 
systems with a finite number of perfectly distinguishable 
states have the same finite number of pure states. While 
from a mathematical point of view, this quantum prop- 
erty is satisfactorily explained as a consequence of the 
mathematical framework of quantum theory, a physical 
explanation of this phenomenon is less evident. 

One might conjecture that the actual state space of a 
physical system is discrete with only finitely many pure 
states (see Fig. 1) [1, 2]. The fact that experiments have 
not found a deviation from the continuous nature of the 
quantum state spaces could then be explained by insuffi- 
cient measurement precision. A qubit, for example, could 
be described by a polytope that approximates the contin- 
uous spherical shape of the Bloch ball very well, while it 
actually is a discrete system. Quantum gravitational con- 
siderations have led some authors to the idea that indica- 
tions for the discreteness of spacctimc could in turn pro- 
vide an indication for the discreteness of quantum state 
spaces [1, 2]. Such considerations might suggest state 
spaces with an extremely high number of pure states, 
but as long as the number of pure states is finite, they 
would differ from quantum state spaces in a fundamental 
way. 

In this work, we present a strong physical counter- 
argument to the idea that quantum theory could be re- 
placed by a theory with discrete state spaces. This ar- 



FIG. 1. One might conjecture that physical state spaces are 
discrete in the sense that they only have a finite number of 
pure states. In such discrete state spaces, the pure states are 
given by the corners of the state space. 



gumcnt is derived from a postulate which claims a very 
basic principle for measurements. It states that if a phys- 
ical system is in a state such that a measurement has a 
definite outcome (i.e. an outcome with probability one), 
then performing this measurement does not change the 
state of the system. We regard this principle to be a 
natural property of a theory that describes physical mea- 
surements, so wc impose it as a postulate. Performing a 
measurement with a definite outcome does not give any 
information, while performing a measurement for which 
the outcome is not known in advance can be seen as a 
process of gaining information. This allows to regard our 
postulate as a converse to the well-known fact in quantum 
theory that information gain causes disturbance [3]: We 
postulate that a measurement with no information gain 
causes no disturbance. We prove that a non-classical 
probabilistic theory which satisfies this postulate cannot 
be discrete. By a discrete system, we mean a system for 
which the state space has only finitely many pure states. 
In other words, we show that every theory which satis- 
fies our postulate must either be classical or it must have 
infinitely many pure states. 
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I. TECHNICAL INTRODUCTION 
The framework 

We formulate our result in the abstract state space 
framework [4-7]. This framework arises from the idea 
to consider the largest possible class of physical theories 
(more precisely, generalized probabilistic theories) which 
satisfy minimal assumptions, containing classical and 
quantum theory as special cases. This allows us to study 
properties of quantum theory, like the non-discreteness 
of the state space, from an outside perspective. Here we 
discuss these minimal assumptions very briefly and refer 
to [8] for a detailed introduction to the abstract state 
space framework and its mathematical background. 

The framework, which relies on four minimal assump- 
tions, is based on the idea that any physical theory ad- 
mits the notions of states and measurements. Their in- 
terpretation is assumed to be given. The first assumption 
is that the normalized states form a convex subset flj^ of 
a real vector space A. The underlying motivation is the 
idea of probabilistic state preparation: If a;,T G il.A arc 
states which can each be prepared by a corresponding 
preparation procedure, then executing the preparation 
procedures with probability p and 1 — p should also lead 
to a state (described by the convex sum puj + {l—p)T) and 
should therefore be an element of ^Ia as well. The sec- 
ond assumption is that the dimension of the vector space 
containing the set of states is arbitrarily large but finite. 
This is a purely technical assumption intended to make 
the involved mathematics feasible. The third assumption 
is that the set of states fi^ is compact. Although there 
might be some physical motivation for this assumption, 
we shall be satisfied with considering it as a technical 
assumption. 

Before we discuss the fourth assumption, we make a 
few comments on the structure of flA- The extreme 
points of flA are the pure states of the system, the other 
elements are mixed states. Since Qa is a convex and 
compact subset of a finite-dimensional vector space A, 
every element of ^Ia is a convex combination of the ex- 
treme points of riA [9]. Thus, every state is a convex 
combination of pure states. Since a convex combination 
is a sum with positive weights that sum up to one, a 
state can be seen as a probability distribution over pure 
states. In general, this probability distribution is not 
unique. In classical theory, however, it is (see the ex- 
ample below). In addition to the normalized states Qa, 
an abstract state space A also contains the subnormal- 
ized states , which are given by all rescalings of the 
normalized states by factors between zero and one. 

The fourth assumption states, roughly speaking, that 
every mathematically well-defined measurement is re- 
garded as a valid measurement: A measurement is a 
finite set Ai = {/i,..-,/n} of functions fi : A M. 
which are called effects, each corresponding to an out- 
come of the measurement. For a state w £ ^Ia, the value 
fiij-o) is interpreted to be the probability that the mea- 



surement yields the outcome i when the system was in 
the state a; prior to the measurement. Thus, one must 
have < fi{Ld) < 1 for all cj E Ha- If the measured 
system was in the state a; with probability p and in 
the state r with probability 1 — p, then the probabil- 
ity pfi{uj) + (1 — p)fi{T) of getting the outcome i has to 
be identical to fi{puj+ (1 —p)t) since pLd+ (1 —p)t is re- 
garded to be a state in its own right (in accordance to the 
first assumption). Skipping a few details, this means that 
effects are assumed to be linear. Moreover, the effects of 
a measurement have to sum up to the so-called unit effect 
fi ~ '^A for which ua{(jj) = 1 for all lo G ^Ia (since 
the probability that any outcome occurs has to be one). 
The fourth assumption is that every set of such linear 
functionals (effects) is a valid measurement. We denote 
the set of all effects on an abstract state space by Ea, 
and we denote a measurement (i.e. a set of effects that 
sum up to the unit effect) by calligraphic letters {M or 
Af in this paper) . These four assumptions determine the 
framework of abstract state spaces. 



Examples 

In the following, we introduce a few examples of theo- 
ries which can be formulated in the abstract state space 
framework. While quantum and classical theory are theo- 
ries of actual physical significance, other theories that we 
introduce play the role of toy theories which are helpful 
to understand the framework. Especially the square and 
the pentagon, which are instances of polygon models (see 
below), will serve as useful examples in the illustration 
of the proof idea of our result. 

Quantum theory: The set of states of a (finite- 
dimensional) quantum system is given by Qa = SiJi) for 
some (finite-dimensional) Hilbert space H, where S{7i) 
denotes the positive operators on H with unit trace (the 
density operators) . These operators form a compact con- 
vex subset oi A = Herm('H), the vector space of Hcrmi- 
tian operators on Ji. Every quantum system has contin- 
uously many pure states. The most general description 
of measurement statistics in quantum theory is given by 
a POVM, which is a set {Fi}^^^ of positive operators 
which sum up to the identity operator I on T-L. They 
give rise to the effects p H' tr{Fip) which sum up to the 
unit effect ua given by p i-> tr(/p) = I for all p £ S{H). 

Classical theory: The states flA of a (finite) classical 
theory are given by a simplex, that is by the convex hull of 
finitely many affinely independent points. (We say that 
points pi, . . . ,p„ in a real vector space are affinely inde- 
pendent if no point is an affine combination of the other 
points, i.e. if for every pi, there are no real coefficients 
{ak}k=ii with J2k^i'^k = 1 such that Y^k^i^kPk = Pi-) 
Examples of simplices are given by a line segment, a tri- 
angle, a tetrahedron, a pentachoron and so on. Every 
element of a simplex Q,a is a unique convex combination 
of the extreme points of ^a (see Fig. 2). Thus, for a sim- 
plex Q.A, the states are in a one-to-one correspondence 
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with the probabihty distributions over the pure states, 
which in the case of a simplex are perfectly distinguish- 
able. This allows to interpret the pure states as classical 
symbols. In a classical system, there is a generic mea- 
surement. For a given state w, the outcome probabilities 
for this measurement are precisely the coefficients in the 
convex sum of the pure states which yield w. 
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FIG. 2. A classical system is described by a simplex, which 
has the property that every point is a unique convex com- 
bination of the extreme points. Thus, a state in a classical 
system corresponds to a unique probability distribution over 
classical symbols. 

Discrete theories: We call Q,a a discrete state space 
if it is the convex hull of finitely many (not necessarily 
afHnely independent) points. Since VIa is compact, this 
is equivalent to saying that the theory has only finitely 
many pure states. Classical theory is an example of a 
discrete theory, while quantum theory is not. 

Box world: The generalized non- signalling theory [10], 
colloquially called box world, is a whole class of state 
spaces which can be formulated as abstract state spaces. 
It includes the well-known PR-box [11] and its local re- 
duced state space, the so-called gbit. All of them have 
only finitely many pure states, so they are all discrete the- 
ories according to our definition. The states VIa of a gbit 
form a square. Since the whole situation can be drawn 
in only three dimensions, the gbit provides an example 
for which wc can give a picture (see Fig. 3). To see the 
interplay of states and effects in such a low-dimcnsional 
example, it is useful to represent effects as vectors in the 
same space as the states [12]. To evaluate an effect at 
some state, one simply takes the scalar product of the 
state and the vector representing the effect. This geo- 
metric representation will be useful in the illustration of 
the proof idea in the Methods section. 

The polygon models [12]: These are abstract state 
spaces where Ha is a regular polygon, so they are special 
kinds of discrete theories. As their states and effects can 
be easily drawn in three dimensions, they also provide 
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FIG. 3. The gbit as an abstract state space. The upper part 
of the figure shows the set of normalized states Qa (gray), 
together with the subnormalized states (white "pyra- 

mid"), which are given by all rescalings of normalized states 
with factors between zero and one. In the lower part of the 
figure, the subnormalized states are omitted. Instead, the ef- 
fects Ea are shown (here they correspond to an octahedron). 
The reader who is familiar with the mathematics of ordered 
vector spaces may notice that the effects arise from the struc- 
ture of the dual cone ^+ (more precisely, the efi'ects form an 
order interval [0, it^] in A*) [8]. Here, they are represented 
as vectors in the same space as the states. To calculate a 
probability /(cj), one simply takes the scalar product of the 
vector Lj and the vector representing /. 



examples that we can depict. The square polygon cor- 
responds to the gbit. In the Methods section below, the 
square and the pentagon will be the central examples in 
the illustration of the proof idea. 

Strictly convex theories: These are theories where the 
set of normalized states Ha is a strictly convex set. A 
strictly convex set is a set where all faces are single points 
(if the notion of a face of a convex set is unknown, see 
[8] or the Supplementary Information). In other words, a 
(compact) strictly convex set is a set such that its bound- 
ary contains no line segment, so the set is "round" at ev- 
ery point of the boundary. For example, the qubit, which 
is represented by the Bloch ball, is strictly convex, but 
every higher-dimensional quantum system is not. The 
latter follows from the fact that if H' is a subspace of a 
Hilbert space H, then S{H') is a face of S((H). 



Post-measurement states 

In the preceding two subsections, we have discussed the 
core structure of abstract state spaces: states and effects. 
They only allow for the description of one-shot measure- 
ment statistics. If one wants to describe the statistics of 
several consecutive measurements, then one has to spec- 
ify what happens to the state of the system when a mea- 
surement is performed (otherwise, the statistics of the 
subsequent measurement cannot be described). In other 
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words, one has to specify a rule for post-measurement 
states. The structure of an abstract state space, how- 
ever, does not provide such a rule. The question of how 
one could specify a rule for post-measurement states in 
abstract state spaces has gained no attention in the lit- 
erature so far. 

We deal with this question and consider some ex- 
tra structure on abstract state spaces which provides 
a rule for post-measurement states. Wc describe the 
transition from the initial state of the system (prior 
to the measurement) to the post-measurement state by 
what we call a measurement-transformation. Our result 
makes a statement about the existence of measurement- 
transformations in abstract state spaces which satisfy a 
certain postulate. 

As wc have just mentioned above, the general idea is 
that a measurement-transformation specifies a rule for 
how post-measurement states are assigned. However, in 
a physical theory, how such a rule looks like depends on 
the particular situation which one wants to describe. To 
be more specific, we can think of at least three such sit- 
uations (we will make quantum examples below), which 
correspond to the case where 

(a) the observer finds out the outcome of the measure- 
ment and describes the state of the system after the 
measurement conditioned on that outcome. 

(b) the observer describes the system after the measure- 
ment by a subnormalized state for the hypothetical 
case that a particular outcome occurred, incorporat- 
ing the probability of that outcome into the post- 
measurement state. 

(c) the observer does not find out the outcome of the 
measurement and describes the state of the system 
after the measurement, knowing only that the mea- 
surement has been performed. 

A physical theory has to allow for a mathematical de- 
scription for all of these cases. Each of the three situa- 
tions can be described by a particular kind of map. To 
understand the difference between them, it is helpful to 
see how these maps look like for the particular case of 
quantum theory. There, if the measurement is a pro- 
jective measurement M = {Pi}f^i, the maps are given 
by Luders projections [13] (the literature is ambiguous 
about which of the three maps is called a Liiders pro- 
jection, but as they are very closely related, this usually 
does not lead to problems). The situations (a), (b) and 
(c) above are described by the following maps: 

(a) If the outcome associated with projector Pk is mea- 
sured, then the state is transformed as 

^ PkpPk 

P ^ — ; r • 

tr(Pfep) 

(b) Considering the outcome associated with projector 
Pfc, the state transforms into a subnormalized state 



as 

P ^ PkpPk ■ 

(c) If the outcome of the measurement is unknown, the 
state is transformed as 

i=i ^ ^' ' i=\ 

Most introductory textbooks on quantum theory only 
discuss situation (a). Note that (a) is not a linear map. 
By the definition that we will make below, it should not 
be called a transformation. The maps (b) and (c) are 
linear. The map (b) describes what Liiders calls a "mea- 
surement followed by selection" , whereas the map (c) de- 
scribes what he calls a "measurement followed by aggre- 
gation" [13]. 

The preceding discussion allows us to understand 
what we mean by a measurement-transformation. By 
a measurement-transformation, we mean a map of type 
(b). Note that such a map leads to subnormalized post- 
measurement states rather than normalized ones. The 
norm of the post-measurement state (the trace-norm in 
the quantum case) is equal to the probability that the 
outcome occurs (which is what we mean by "the proba- 
bility of that outcome is incorporated into the state" ) . 

Choosing maps of type (b) (rather than maps of type 
(a) or (c)) as the subject matter is not a relevant restric- 
tion since the three types of maps are so closely related 
that insights into one of these maps translate into insights 
into the other maps as well. In particular, from the map 
of type (b), one can construct the map of type (a) by 
rcscaling the images with the inverse probability and the 
map of type (c) by summing up over all outcomes. 

With the above motivation in mind, we now proceed 
to the task of formally defining what we mean by a 
measurement-transformation on an abstract state space. 
A transformation T on an abstract state space A is a 
linear map T : A ^ A such that r(f^^) C rj^\ The 
motivation for the linearity of transformations is simi- 
lar to the motivation for the linearity of effects. The 
linearity expresses a compatibility condition for proba- 
bilistically prepared states: If the system is in a state 
Lj with probability p and in a state r with probabil- 
ity 1 — p before the transformation, then the trans- 
formed state pT{uj) -f (1 — p)T{t) has to coincide with 
T{pu! -f- (1 — p)t) since puj -I- (1 — p)t is regarded as a 
state in its own right. (A more rigorous argument would 
require pf{T{u)) + (1 -p)/(r(r)) = f{T{pLo + (1 -p)t)) 
for all effects /, which eventually boils down to what 
we have just required.) A measfiremeni-transformation 
has to satisfy one more condition. As we have explained 
above, a measurement-transformation is associated with 
a particular outcome, or more precisely, with a particu- 
lar effect. If T is a measurement-transformation for an 
effect /, then we require that the norm ua{T{lj)) of the 
transformed state is equal to the probability f{uj) for 
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measuring the outcome associated with /. In short, we 
require 

UAoT = f . 

In quantum theory, where ua is given by the trace, this 
property is satisfied for projective measurements since 
the Liiders projection gives tr(P/9P) = ti{P^p) = tT{Pp). 

We will only consider measurement-transformations 
for a special class of effects which we call pure effects. 
We say that an effect / e Ea is pure if it is an ex- 
treme point of the (convex) set of effects Ea , and we say 
that a measurement A4 = {/i, . . . , /„} is pure if every 
effect /i , . . . , /„ is pure. It turns out that in the case 
of quantum theory, an effect F i-> tr(Fp) of a POVM 
element F is pure if and only if is a projector [8]. 
Thus, we only consider measurement-transformations for 
a class of effects which, in the case of quantum theory, 
reduces to projectors. For this class, the measurement- 
transformations are given by Liiders projections. The 
fact that we will restrict our considerations to pure effects 
is not a restriction of the validity of our result. Quite the 
contrary, this makes our result stronger. As we will see 
below, our postulate claims a property of measurement- 
transformations for pure effects rather than claiming this 
property for all effects. This results in a weaker pos- 
tulate, so every implication derived from this postulate 
leads to a stronger result. As we will see later, we will 
restrict the claim of the postulate to an even smaller sub- 
class of effects (see the Methods section and the Supple- 
mentary Information for further details). 

In a nutshell, a measurement-transformation for a pure 
effect / is a linear map T : A ^ A with T{nA) C 
and Ua o T = f. 

II. MAIN RESULT 

Let us first state our postulate. For a mathematically 
precise formulation, we refer to the Methods section and 
the Supplementary Information of this article. 

Postulate (No information gain implies no disturbance): 
If a physical system is in a state uj such that some pure 
measurement has a certain outcome (i.e. an outcome 
with probability one) , then there exists a way to perform 
this measurement without changing the state lo. 

In more technical terms, the postulate states that for 
every pure effect / <S Ea, there exists an associated 
measurement-transformation T with ua o T — f such 
that for all states tu E Ha with /(w) = I, we have 
that T{uj) = UJ. The existence of such a measurement- 
transformation T is what is meant by saying that there 
exists a way to perform the measurement. Furthermore, 
note that without looking at the definition of a measure- 
ment transformation, saying that "there exists a way to 
perform the measurement" may appear trivial by itself. 
After all, doing nothing and outputting the measurement 



outcome (associated with) / preserves w and yields / 
with probability 1. This case is ruled out by the defi- 
nition of a measurement-transformation. More precisely, 
note that T must be such that {uaoT){u!') = f{uj') for all 
states w' £ That is, it yields the correct probabilities 
for any state that we wish to measure. 

It is interesting to note that the actual proof of our 
main result only needs an even weaker, but rather tech- 
nical requirement (see the Methods section). To see the 
link to information gain, note that the Shannon infor- 
mation content (see e.g. [14]) — log/(a;) is zero for any 
outcome of an experiment that occurs with certainty. As 
such, f{uj) = 1 is equivalent to stating that no informa- 
tion gain occurs. The demand that T(cj) = a; says that 
the state is unchanged, i.e., no disturbance has occurred. 

Quantum theory and classical theory satisfy this pos- 
tulate. In quantum theory, for example, if a system is 
in a state p such that a projective measurement {Pi}f^i 
has some outcome k with probability tr(Pfcp) = 1, then 
the transformation p i~>- PkpPk leaves the state invariant. 
Quantum theory even satisfies the postulate in a much 
stronger form in the sense that little information gain also 
causes only little disturbance. This can be seen from a 
special case of the gentle measurement lemma [15, 16]. 
It states that if measuring an outcome associated with a 
projector F has probability ir^Fp) > 1 — e, then mea- 
suring that outcome disturbes the state by no more than 
Hp — FpF\\i < VSe- Setting e = 0, this reduces to our 
postulate. However, we emphasize that our postulate is 
much weaker than postulating the gentle measurement 
lemma. We also note that our postulate does not make 
any assumptions about locality, i.e., it does not make 
a statement about whether verification measurements of 
bipartite states can be implemented on local quantum 
systems or locally disturb the state as has been consid- 
ered in [17]. 

Even though the statement of the postulate is very 
concise, it may appear unsatisfying since it involves the 
abstract concept of a state, which is something that one 
cannot observe directly. However, it can be reformulated 
in purely operational terms, referring only to directly ob- 
servable objects, namely measurement statistics. Such a 
reformulation is possible because two states can be re- 
garded as being identical if and only if they induce the 
same measurement statistics for every measurement (in 
more mathematical terms, a state w is an equivalence 
class under the relation a; ^ w' <^ ifi^) = /(w') for 
all / g Ea)) [18]. Hence, instead of making statements 
about states, one can make statements about the statis- 
tics of all potential measurements. Figure 4 illustrates 
the idea of this reformulation. 

In terms of the postulate, our result can now be stated 
as follows. 

Result: An abstract state space which satisfies the pos- 
tulate is either non-discrete (i.e. it has infinitely many 
pure states) or it is classical. 
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Consider a preparation V which outputs an initial state 
uj G Q,A and a measurement M — {/i, ...,/„} such that 
fk{y>) ~ 1 for some k. According to the postulate, the 
state of the system after the following two experiments 
are identical: 



M 



outcome k 
with certainty 



identical states, 
Tk{uj) = UJ 



Thus, if the two experiments are followed by any mea- 
surement, say M = {g\, . . . , gi\, then the statistics of the 
A/'-measurement coincide: 



M 



outcome k 
with certainty 



I 

1 



identical 
statistics 
for every 
measure- 
ment N 



The A/'-statistics coincide for every measurement N ■ 
This is equivalent to saying that the states prior to the 
A/'-measurement (i.e. T^iuj) and a;) are identical. Thus, 
we do not need to refer to states and can reformulate the 
postulate as: // a measurement has a definite outcome, 
then performing thts measurement does not influence the 
statistics of any subsequent measurement. Diagrammati- 
cally, 




identical statistics for every M 



FIG. 4. A reformulation of the postulate in purely op- 
erational terms: Instead of referring to initial and post- 
measurement states, the reformulated version states that a 
measurement with a definite outcome does not influence the 
statistics of any subsequent measurement, so it only refers to 
directly observable quantities. 



This means that if a physical system is described by 
an abstract state space where the set of states VI a is a 
polytope which is not a simplex (i.e. if it is a discrete 
non-classical system), then it violates our postulate. 

What is more, our result is robust in the sense that 
discrete non-classical theories arc ruled out even if the 
postulate is weakened to an approximate version. To for- 
mulate this approximate version of the result, we assume 
that A is equipped with a norm || • H^. This induces a dis- 
tance function dist(a;,a;') :— ||a; — lo'Wj^ on A. We prove 
that for every discrete non-classical theory, equipped 



with some norm || • \\a, there is a positive number e > 
such that the implication /(w) = 1 ^ — '^lU — ^ 

(where T is the measurement-transformation for /) can- 
not be satisfied for every pure effect / £ Ea- We prove 
this approximate case, which is a stronger version of the 
result, in Section B of the Supplementary Information. 



III. DISCUSSION 

Our simple postulate rules out discrete non-classical 
theories, while classical and quantum theory satisfy the 
postulate. Read in the contrapositive, our postulate says 
that disturbance implies information gain. Any theory 
that does not satisfy our postulate thus allows for dis- 
turbance without a corresponding ability of information 
gain. Note that even in a theory which a priori only de- 
fines transformations T , one can define effects as ua ° T . 

We also note that our postulate rules out several al- 
ternatives to quantum theory, most notably the famous 
PR-box [11, 19, 20] that allows a violation of the CHSH 
inequality [21] far beyond the limits of quantum theory. 
More specifically, the PR-box allows a violation that is 
only limited by imposing the rule that no information 
can travel faster than light. This is in spirit similar to 
other approaches such as information-causality [22] , com- 
munication complexity assumptions [23] , the assumption 
of local quantum mechanics [24] or the uncertainty prin- 
ciple [25]. We emphasize, however, that whereas this is 
a nice byproduct of our result, our real aim lies in the 
study of local physical systems with the goal to identify 
just one postulate that sheds light on the simple question 
whether the state space should be discrete or continuous. 
It is very satisfying that this question can be understood 
by introducing just a single postulate. 

One may wonder whether our postulate does in fact 
rule out all theories but classical and quantum mechan- 
ics. To answer this question, let us first be more pre- 
cise about what we mean by "a theory is (not) ruled out 
by the postulate". We mentioned in the preceding sec- 
tion that for general abstract state spaces, measurement- 
transformations are not specified, so we cannot make 
statements saying that the measurement-transformations 
do (not) satisfy our postulate. Instead, we can discuss the 
following well-defined question: Given an abstract state 
space, is it true that for every pure effect, there exists 
a measurement-transformation which satisfies our postu- 
late? If this is the case, then we say that the theory can 
satisfy the postulate, or that it is not ruled out by the 
postulate. If this is not true, then we say that the theory 
cannot satisfy the postulate, or that it is ruled out by the 
postulate. 

This is the precise meaning of our statement that "dis- 
crete non-classical theories are ruled out by the postu- 
late". Using this terminology, we can identify a class of 
theories which, in addition to classical and quantum the- 
ory, is not ruled out by the postulate: the strictly convex 
theories can satisfy our postulate. There are more thco- 
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ries which can satisfy the postulate, but we do not know 
a concise classification. For example, a state space V.a 
formed like a piece of pizza is ruled out by the postulate, 
while a state space formed like an ice cream cone is not. 
Figure 5 gives an overview. 




postulate can 
be satisfied 



some theories can satisfy the postulate, 
some theories cannot, for example: 

A state space with a "piece of 
pizza" form is ruled out, 

^ -X but a state space with an "ice 



V 



cream cone" form can satisfy 
the postulate. 



FIG. 5. An overview over the abstract state spaces ruled out 
by the postulate. 

In the recent past, there have been several attempts 
to derive (finite-dimensional) quantum theory within a 
framework of probabilistic theories [26-28]. The idea is 
the following. One starts with a very general framework 
of probabilistic theories (like the abstract state space for- 
malism). Then, one imposes a few physical postulates 
(our postulate can be seen as one such postulate). If one 
manages to show that all theories in this framework other 
than quantum theory are ruled out by these physical pos- 
tulates, then this can be seen as a physical derivation of 
quantum theory. As our postulate rules out quite a large 
fraction of all possible abstract state spaces already (see 
Fig. 5), it seems promising that adding just a few more 
postulates might be sufficient to rule out all theories ex- 
cept for quantum theory. 

However, we do not make such an attempt and focus 
on one particular aspect only, introducing only one pos- 
tulate. What makes our postulate special is that its na- 
ture is very different from the postulates that have been 
considered in this context so far. Many approaches fo- 
cus on the aspect of non-locality, introducing rules for 
how physical systems are combined to form bi- or multi- 
partite systems. In contrast, our approach deals with 
local state spaces only, making a statement about post- 
measurement states. Within probabilistic theories, this 
aspect has gained no attention in the literature so far. 
The fact that we introduce just one postulate (instead of 



a set of postulates) helps us to understand its influence 
on one particular aspect of of physical theories. 

One might argue that an experimental proof of the 
non-discreteness of physical state spaces needs infinite 
measurement precision since the verification of the postu- 
late that T{io) = ijj (strict equality) requires the verifica- 
tion that ijj and T{yi) give rise to the same measurement 
statistics (to arbitrary precision). Hence, our result is 
experimentally less accessible than other no-go theorems 
(e.g. the Bell Inequality, where it is sufficient to verify the 
violation of a single statistical inequality). There is a par- 
tial reply to this objection. As we have mentioned before, 
there is an approximate version of our result. It states 
that for a given polytope P, there is a positive number 
ep > such that the postulate can be weakened to the 
following form (without changing the validity of the re- 
sult): If a measurement on a state has an outcome with 
probability one, then performing the measurement docs 
not change the state of the system by more than tp (for 
details, see Part B of the Supplementary Information). 
Thus, even if one weakens the postulate to allow for an 
ep-disturbance of the state, it still rules out the poly- 
tope P. This is a stronger form of the result. It states 
that in order to rule out a given polytope experimentally, 
only finite measurement precision is needed (quantified 
by ep). However, the allowed disturbance ep depends 
on the polytope P, so in order to rule out all polytopes 
experimentally, infinite measurement precision is needed 
because for every measurement error, there could be a 
polytopic theory for the measured system for which the 
allowed disturbance ep is too small to be tested. 



IV. METHODS 

In this section, we sketch the idea of the proof of our 
main result. For the full version of the proof and for a 
proof of the approximate version of our result, see the 
Supplementary Information of this article. Here, we aim 
for a geometric understanding of the proof. It is mainly 
based on a lemma which establishes geometric criteria for 
a set of states Qa which is compatible with our postu- 
late. To illustrate this lemma, we provide two very basic 
examples which violate these criteria: the square and the 
pentagon (see Fig. 7). For these two examples, it is easy 
to see geometrically why they cannot satisfy our postu- 
late (as we will illustrate in Fig. 8). Finally, we describe 
roughly how we prove that every polytope Ha which sat- 
isfies the conditions of the lemma is a simplex (which is 
our main result). 

Before we sketch the proof of the main result, it is 
useful to define in a bit more detail what an abstract 
state space is. For detailed definitions of the framework, 
see the Supplementary Information of this article, for a 
detailed motivation of the framework with detailed ex- 
amples, see Chapter 3 in [8]. 

As illustrated in Fig. 6, an abstract state space is fully 
specified by a tuple (A, Aj^., ua), where A is a real finite- 



8 



dimensional vector space, is a cone in A and ua is a 
linear functional on A (called the unit effect). This linear 
functional is required to be strictly positive on the cone 
A+ (i.e. ua{uj) > for all lo G A+ \ {0}). The tuple 
(A, A^,ua) gives rise to the normalized states ^Ia and 
the subnormalized states ft^^ in the following way (c.f. 
Fig. 6): 

riA ■■= {uj€A+\ ua{lu) = 1}, 
{LoeA+\ ua{lo) < 1}. 



A+ 




FIG. 6. The states of any normalization are given by a cone 
A+ in the real vector space A. The linear functional ua gives 
the normalization of a state, so the intersection of A+ with 
the plane described by UA{ijj) = 1 gives the normalized states, 
while the subnormalized states are those elements of A+ 
where ua takes values between and 1. 

The set Ea of effects on A is given by the linear func- 
tionals which take values between zero and one on the 
states i^A, i-e. 

EA:={f eA* \ 0< f{iu) < 1 -iu^^lA], 

where A* is the dual space of A. A measurement is given 
by a finite set of effects M — {/i , . • . , /«} C Ea such that 
the effects sum up to the unit effect ua, i.e. Yll=i fi ^ 
Ua- Recall that if the system is in the state uj e Qa prior 
to the measurement described by = {/i, . . . , /„}, then 
the probability for outcome k is given by /fc(w). 

As we have mentioned earlier, we restrict ourselves to 
pure effects when we deal with post-measurement states 
(i.e. with measurement-transformations). The pure ef- 
fects are the extreme points of Ea ■ A pure effect / S Ea 
has the property that the set of states w which have prob- 
ability /(w) = 1 is a face of ^Ia [8]. A face of ft a is 
a convex subset F C flA with the property that every 
line segment whose endpoints are contained in F must 
be fully contained in F, that is a face is some kind of 
"extreme subset". For a pure effect /, this allows us to 
define the certain face Ff of / by 

Ff:^{u;enA\f{uj)^l}. 

Analogously, the set of states ut which have probability 
f{uj) = is a face of ^Ia as well [8]. We call it the 
impossible face of / and define it by 

Ff {ujenAl /(w) = 0} . (1) 



The notion of the certain face and the impossible face of 
an effect is central in our proof. 

A transformation on an abstract state space is a lin- 
ear map T : A ^ A which is positive (i.e. T(A+) C 
A^) and does not increase the norm of the states, i.e. 
UAiT{uj)) < uaIi-^) for all uj G Equivalently, a trans- 
formation is a linear map T :A^ A with T(^2^) C Uj. 
Recall that we describe the state change due to a mea- 
surement by introducing measurement-transformations. 
If a measurement yields an outcome associated to a pure 
effect / G Ea, then the transformation of the state is 
described by w i— >■ T{ui), where T is the measurement- 
transformation for /. As mentioned, we require that T 
is a transformation which satisfies ua oT = f . 

With these definitions at hand, we can formulate our 
postulate as follows: 

Postulate: For every pure effect / G Ea, there is a 
transformation T : A ^ A such that / = ua ° T and 

T{uj) = u) for every uj € Ff. 

Note that we only postulate the existence of a 
measurement-transformation for / that satisfies our pos- 
tulate. For the actual proof, we will require an even 
weaker condition. We will not require the existence of 
such a measurement-transformation for every pure effect 
but only for pure effects for which the certain face Ff is 
what we call a minus-face of ^Ia. This is a face which is 
exactly one dimension smaller than flA. This weakening 
of the postulate is particularly useful for the proof of the 
approximate version of our result. 

To derive the result, we first prove a lemma which es- 
tablishes geometric criteria which a set of states flA has 
to satisfy to be compatible with our postulate. Given a 
pure effect / £ Ea, the lemma tells us geometric criteria 
for the certain face Ff and the impossible face Ff oi f 
which arc necessary for the existence of a measurement- 
transformation satisfying our postulate. It reads as fol- 
lows: 

Lemma: Let {A, A+,ua) be an abstract state space, let 
/ G Ea be a pure effect. If there exists a transformation 
T : A ^ A such that ua°T = f and T{lu) = lo for every 
U! E Ff, then 

(a) dim Ff + dim Ff< dim 57^ — 1 and 

(b) aS{Ff U Ff) n = conv(F/ U Ff), 

where aff(-) and conv( • ) denote the afRne hull and 
the convex hull, respectively (the reader unfamiliar with 
these two notions is referred to the Supplementary Infor- 
mation of this article). 

To get a geometric idea for the two conditions (a) and 
(b), it is useful to consider abstract state spaces which 
violate these conditions. The two simplest examples we 
can think of arc the square and the pentagon, depicted 
in Fig. 7. 




dim + dim = 1, aS{Ff U F f) HQa = ftA, 

dimf^A = 2, so but conv(F/ U Ff) / Qa 

dim Ff + dim F f > dimS^A — 1 

FIG. 7. The square and the pentagon serve as very basic 
examples of abstract state spaces which violate the conditions 
stated in the Lemma. The square violates condition (a) , while 
the pentagon violates (b). 

To see why the conditions (a) and (b) are necessary 
for the existence of a transformation compatible with our 
postulate, we now examine what goes wrong in the case 
where one of the conditions is violated. If condition (a) is 
violated, a contradiction occurs which we call a dimen- 
sion mismatch. If (b) is violated, then we say that a 
shape mismatch occurs. Again, the square and the pen- 
tagon serve as good examples for a geometric illustration. 

Dimension mismatch: If condition (a) is violated (i.e. 
dim Ff + dim Ff > dimO^ ~ l)j then there is no linear 
map T such that 

UAoT = f, (2) 
r(w) = u) for every u € Ff (postulate) . (3) 

In particular, there is no transformation with these two 
properties. To see this, there are two things to notice. 

First, Equation (2) implies that UA{T{io)) = f{uj) = 
for all w e Ff (c.f. the definition (1) of Ff). Since the 
zero-vector w = is the only state (i.e. the only ele- 
ment of f^^^) for which f{uj) ~ 0, it follows that the 
whole impossible face F f has to be mapped to the zero- 
vector. By the linearity of T, this implies that the re- 
striction T^^^^g^^-^ of T to span(F/) is the zero-operator 

on span(_F/): 

^lspan("Fj) ~ 0|spa^,j(-p^,) . (4) 

Secondly, the postulate (3) and the linearity of T imply 
that the restriction T|span(_Fj:) of T to span(F/) is the 
identity operator on span(i^^): 

T|span(Fj.) = ^lspan(Fj) ■ (5) 

However, in the case where dim Ff + dim F f > dim — 
1, equations (4) and (5) lead to a contradiction. In this 
case, the intersection span(F/) n span(F/) is a subspace 
which is at least one-dimensional (see Fig. 8). Equations 
(4) and (5) imply that on this subspace, T has to be the 
zero-operator and the identity-operator simultaneously. 
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Dimension mismatch: 




span(J» nspan(F/) / {0} 



Shape mismatch: 




T{Ff) = 

FIG. 8. Violation of conditions (a) or (b): This figure illus- 
trates geometrically why the square and the pentagon violate 
our postulate. 



which could only be satisfied if the subspace would be 
{0}. 

Shape mismatch: If condition (b) is violated (i.e. 
conv(F/ UFf)^a.S{FfUFf)nflA), then for every lin- 
ear map which satisfies Equations (2) and (3), there is a 
state p such that T{p) ^ fl^^ (i.e. T{p) is not a state). 
Therefore, such a T cannot be a transformation. To see 
this geometrically, it is useful to consider the pentagon 
for a particular choice of the effect / where the certain 
face Ff is an edge of the pentagon (see Fig. 8). Equation 
(2) implies that the impossible face F f is mapped to the 
zero-vector, while Equation (3) means that the certain 
face Ff is left invariant. In the case of the pentagon il- 
lustrated in Fig. 8, there is precisely one linear map T 
with these two properties. It maps the normalized states 
r^A (dark gray surface in the figure) to a set in the vector 
space (dashed lines) which is not contained in $7^^ (the 
truncated cone between and Qa)- In particular, there is 
a p such that T{p) ^ ^Ia- If one compares Fig. 8 with Fig. 
7, then one can see that the part of VIa which is mapped 
to a subset of fl^^ (hght gray face in Fig. 8) is precisely 
given by conv(_Fy U F f) (gray part in Fig. 7). However, 
the part of Qa which is mapped outside of flA is given 
by (aff(F/UF/)nl7A)\conv(F/UF/) (the white part in 
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Fig. 7). This observation generalizes to statement (b) of 
the Lemma: If (a) is satisfied, then T(f2^) is contained 
in if and only if afr(i^/ uF/) n 17^ = conv(F/UF/). 

These two examples illustrate all that can go wrong for 
discrete theories. We show that for every discrete theory 
(i.e. for every theory where VIa is a polytope), either 
condition (a) or (b) is violated (so cither a dimension 
mismatch or a shape mismatch occurs), except for the 
case where is a simplex (i.e. for classical theories). 
To show this, we proceed as follows. 

We consider an abstract state space {A, A+, ua) where 
il.A is a polytope. We assume that for every pure effect 
/ S Ea for which the certain face Ff is a minus-face 
of r^A, there is a measurement-transformation satisfy- 
ing the postulate (3). In a first step, we show (using 
the Lemma) that every polytope Ha which is compatible 
with our postulate has a property that we call being uni- 
formly pyramidal. This means that for every minus-face 
F of ^Ia, it holds that there is a point ap G ^Ia such that 
Ha = conv(F U {ap}) (sec the Supplementary Informa- 
tion for more intuition and figures). In a second step, we 
show that every uniformly pyramidal polytope Ha is a 
simplex. This shows that every discrete theory satisfying 
our postulate has to be classical. 
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SUPPLEMENTARY INFORMATION 



Appendix A: Formal proof of the main theorem 



In this appendix, we prove the main result: A polytopic theory for which our postulate holds is a classical theory. 
A preliminary version of this proof is available at [8] . The proof provided here is much more concise than the proof 
in [8]. It makes no use of strong theorems but is entirely proved on quite an elementary level. Fewer notions are 
introduced, so we only make definitions that are necessary for clarification or that simplify the proof. All in all, the 
proof and its preparation presented here are much shorter. 

The focus of the proof in this appendix is on technical precision. To organize this appendix in a compact way, 
we proceed as follows: In Section Al, we list all the definitions that are necessary to understand the proof. This 
list is given for technical clarification only and cannot be regarded as an introduction to the subject. For a detailed 
introduction to all of the notions and concepts mentioned below, we refer to [8]. In Section A 2, we give a list of facts 
that we state without giving a proof here. These facts are either standard mathematical results, easy to verify or we 
have proved them in [8]. Whenever the latter is the case, we refer to the corresponding proposition. The referenced 
propositions have quite elementary proofs which are not very important for the understanding of the proof of the 
main result. Section A3 is dedicated to the preparation of the main proof. We give a rough outline of what we will 
show and derive a few technical lemmas. Finally, wc prove the main result in Section A 4. 



1. Definitions and Notation 



(Def. 1) A subset C of a real vector space 1^ is a convex subset or convex set ii x,y £ C implies Ax + (1 — X)y G C 
for all < A < 1. 

(Def. 2) A nonempty convex subset of a convex set C is a face if a;, y G C, < A < 1 and Xx + (1 — X)y G F imply 
x,y e F. 

(Def. 3) An element e G C of a convex set is an extreme point of C if the singleton {e} is a face of F, i.e. if x, ?/ G C, 
< A < 1 and Ax + (1 — A) = e imply x = y = e. The set of extreme points of C is denoted by vert(C). 

In the following, V denotes a finite-dimensional real vector space, S CV denotes any subset of V. 

(Def. 4) conv(S') denotes the convex hull of S (see Fig. A 1), given by 



conv(S') % ^ ca^i 



G {0,1,2,...},?;, G S,a^ G [0,1], = 1 



(Def. 5) aff(5') denotes the affine hull of S (see Fig. Al), given by 



aff (5) := \ J2 



G {0,l,2,...},w, G GM,^a, = 1 



i=i 



(Def. 6) span(S') denotes the linear hull (or the linear span) of S. 

(Def. 7) A point p €z V is ajfinely independent of S li p ^ aff(S'). Points pi, . . . ,p„ G V are affinely independent if 
p, ^ aff({pi, . . . ,pn} \ {pt}) for i 1, . . . , n. 

(Def. 8) dim S* := d, where d + 1 is the maximal number of affinely independent points in 5*. (In the case where S 
is a vector subspace of V, this dimension is identical to the vector space dimension of S.) We define the 
dimension of the empty set to be —1. 

The notion of the dimension of a subset of a vector space (Def. 8) allows us to define the following special type of a 
face of a convex set (recall (Def. 2)). 

(Def. 9) We call a face F of a convex set C a minus-face of C if dimF = dimC — 1.^ 



In the context of polytopes, such a face is sometimes called a 
facet of C. However, besides the fact that the the terms "face" 
and "facet" are easily mixed up, the use of the notion of a facet 



in the literature is inconsistent, so we prefer to introduce a new 
name to avoid confusion. 
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FIG. 9. The convex hull of two points p and q is given by the line segment connecting the two points, while the affine hull of 
two points is the whole line through the two points. The point r is affinely independent of p and q since r ^ aff({p, g}). The 
convex hull of three points (which do not lie on a line) is given by the triangle the corners of which coincide with the three 
points, while the affine hull is given by the whole plane containing the three points. 

The following types of convex sets are very central in our analysis. 

(Def. 10) A subset P of a finite-dimensional real vector space is a polytope if P is the convex hull of finitely many 
points. 

(Def. 11) Since vert(P) is a finite set for any polytope P (which is readily verified), we can define the number of 
extreme points of a polytope P by ne{P) := \ vert(P)|. 

(Def. 12) A subset S" of a finite-dimensional real vector space ^ is a simplex if S is the convex hull of affinely independent 
points. More precisely, S* is a d-simplex if it is the complex hull of d+1 affinely independent points. Obviously, 
a simplex is a polytope. 

Before we can give a precise definition of an abstract state space, we introduce some notation and some notions related 
to vector spaces with a cone. 

(Def. 13) For any two subsets M and of a real vector space and for any scalar a e R, we denote 

M + N := {m + n\me M, 72 e N} , 
aM := {am \ m e M} . 

(Def. 14) A nonempty subset K oi a. real vector space y is a cone in V if the following conditions are satisfied: 

K + K = K , 

aK CK Va > , 

Kn{-K) = {0}. 



A cone K in V is called generating if K — K = V . 

(Def. 15) For a vector space V, let V* denote the dual space^ of V. A linear functional f € V* on a real vector space 
with cone K is called strictly positive if f{v) > for all v G K \ {0}. 

In the following, we recall the basic definitions in connection with abstract state spaces. 

(Def. 16) An abstract state space is a tuple {A, A^,ua), where A is a finite-dimensional real vector space, is a 
closed' and generating cone in A and ua is a strictly positive linear functional, called the unit effect. We 
will often denote an abstract state space by A rather than {A, A^, ua)- 



^ We will only consider finite-dimensional vector spaces V, for 
which there is no difference between the algebraic and the topo- 
logical dual space. 



^ The closedness is to be understood with respect to any norm 
on A. Since all norms on finite-dimensional vector spaces are 
equivalent and therefore induce the same topology, the choice of 
the norm is irrelevant. 



14 



(Def. 17) For an abstract state space A, the symbol VIa denotes the set of normalized states, 

nA^{ujeA+\ ua{uj) = 1}. 
The set of subnormalized states is defined by 

= {cj G A+ I ua{<^) < 1} . 



Obviously, Q'^' = conv(f2^ U {0}) (also see Fig. 10). 



A. 




FIG. 10. The states of any normalization are given by a cone A+ in the real vector space A. The linear functional ua gives the 
normalization of a state, so the intersection of A+ with the plane described by ■ua(<j-') ~ 1 gives the normalized states, while 
the subnormalized states are those elements of A+ where ua takes values between and 1. 

(Def. 18) An effect f on an abstract state space ^ is a linear functional f ^ A* such that < /(w) < 1 for all co G ^Ia- 
The set of effects on A is denoted by Ea- An effect is said to be pure if / is an extreme point of Ea- 

(Def. 19) A measurement on an abstract state space A is a set A4 = {/i, . . . , /„} of effects which sum up to the unit 
effect, X]"=i fi — '^A- (We give this definition for the sake of completeness. We will not use this notion 
below, but we will formulate all statements in terms of effects.) 

(Def. 20) For a pure effect / e Ea, the effect / := ua — / is the complementary effect to f . 

(Def. 21) For a pure effect /, the certain face Ff of / is defined by 

Ff:^{LuenA\f{Lo) = l}. 

The impossible face F f of / is defined by 

Ff ■.^{uenA\f{io)^o}. 

Obviously, Ff — Fj. (Fact 10) and (Fact 11) below imply that Ff and Ff are faces indeed, so the namings 
are justified. 

(Def. 22) A transformation T : A ^ A on an abstract state space A is a map which fulfills the following conditions: 

T is linear, 

T is positive, i.e. T{A+) C A+, (Al) 
ua{T{uj)) < 1 for aU w € Ha- (A2) 

Given that T is linear, conditions (Al) and (A2) can be summarized as 

Ti^A) C nf . (A3) 



(Def. 23) An abstract state space ^ is a polytopic theory if ^Ia is a polytope. (This is what we called a discrete theory, 
but in this technical appendix, we use the term "polytopic theory" because a polytope is a well-established 
mathematical term in the context of convex set.) 

(Def. 24) An abstract state space A is a classical theory if Ha is a simplex. 
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2. Known facts 

Recall the definition of a face and of an extreme point of a convex set, (Def. 2) and (Def. 3). 

(Fact 1) For a convex set C, a face 5* of a face of C is itself a face of C. Moreover, if 5, F C C are faces of C with 
S C F, then 5* is a face of F as well. Thus, for a face F of a convex set C, e E F is an extreme point of F 
(Def. 3) if and only if it is an extreme point of C. 

(Fact 2) For a face F of a convex set C, if a convex combination X]r=i li^^ ^ some ui, . . . , w„ e C and 
nonzero coefficients a^, then vi, . . . ,Vn £ F. [8, Prop. 2.7] 

(Fact 3) If F is a face of a convex set C, then F = aff (F) n C. [8, Prop. 2.10] 

In the following, V denotes a finite-dimensional real vector space and C y is any subset of V. 

(Fact 4) span(^) aff(5U {0}). 

(Fact 5) If ^ aff(5), then dim(span(S')) = dim^ + 1. 
(Fact 6) aff(conv(5)) = aff(S'). 

Let T : A ^ B he a linear map between vector spaces A and B, let 5*1, 5*2 C A be any subsets of A. 

(Fact 7) r(aff(S'i U S2)) = aff (T(^i) U T{S2)). 

(Fact 8) r(conv(S'i U S2)) = conv(T(5i) U T{S2)). 

(Fact 9) If T is bijective. Si C S2 and T{Si) D r(52), then S*! = fi-a. 

We will make use of the following properties of abstract state spaces. Recall (Def. 16), (Def. 17) and (Def. 18). 

(Fact 10) Let A be an abstract state space. If / € Fa is a pure effect, then the complementary effect / := ma — / is a 
pure effect as well. [8, Prop. 3.33] 

(Fact 11) For every nonzero pure effect / G Ea, the certain face Ff of / (as defined in (Def. 21)) is indeed a nonempty 
face of ri^. [8, Corollary 3.37] Thus, by (Fact 10), the impossible face F f = Fj is a face of Ha as well (which 
is nonempty if / 7^ u^). 

(Fact 12) For any subset S C Qa it holds that ^ aff(5). (This follows from aff(5) C aff(riA) = {w G A | ua(w) = 
1};^0.) 

The following facts about polytopcs will be useful. 

(Fact 13) Every polytope P has a minus-face. (More than that, there are lower bounds on the number of minus-faces 
of polytopes; see [29, Chapter 3.1].) 

(Fact 14) A face of a polytope is a polytope. (This is easily proved using (Fact 2).) 

3. Technical lemmas 

In this section, we prove four technical lemmas. Before we prove them, we give a rough overview over their role in 
the main proof. The main result. Theorem 5, is proved in two steps: 

(i) First, we show that every polytopic theory A which satisfies our postulate has a set of normalized states 57a 
which is uniformly pyramidal (we will see in Section A 4 what this means). 

(ii) In the second step, we show that every uniformly pyramidal polytope is a simplex, so ^ is a classical theory. 

The main technical lemma that allows us to prove these two steps is Lemma 2. It establishes geometric criteria that a 
set of states has to satisfy to be compatible with our postulate by specifying conditions on the certain face (Def. 21) 
of pure effects. This lemma has two parts (a) and (b), both of which we will use in step (i) of the proof of Theorem 
5. In order to prove Lemma 2, we need Lemma 1 which we prove first (this lemma will also be useful in the proof of 
step (ii)). To be applied properly. Lemma 2 needs a helper which comes in the form of Lemma 4. It shows that in 
the case where J7a is a polytope, every minus-face of J^a is the certain face of a pure effect. Lemma 3 in turn helps 
us to prove Lemma 4. Figure 11 gives an overview over the organisation of the proofs. 
Now we prove the four lemmas. 
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FIG. 11. This diagram shows how the proof of Theorem 5 is subdivided into several Lemmas. 



Lemma 1: Let C be a convex subset of a vector space V, let a <S be affincly independent of C (i.e. a ^ aff(C)). 
If F is a face of C, then D = conv(F U {a}) is a face of conv(C U {a}). 




FIG. 12. This figure visualizes the statement of Lemma 1. 



Proof. Let x,y e conv(C U {a}). Then, 

771—1 m 

aiPi + amO, for some a; > with ai = 1 



777—1 

X = 

i=l i=l 

and Pi e C Vi e {1, . . . , m — 1} . 



We can simplify the expression for x by defining 



777— 1 777—1 



Er77— 1 
7=1 "iPi ^ ^ _ 



E777 — 
7=1 



a,; 



7 = 1 



to get 

X = ap + aa with Q;,a>0, Q! + S=1, p € C . (A4) 

In a similar way, we get 

y = l3q + ]3a for some /3,^>0, /3+^=l, gGC. (A5) 
Let < A < 1, A = 1 - A. Then, 

Xx + Xy = Xap + Xl3q+ {Xa + X^)a. (A6) 

Suppose that 

Aa; + Ay = z for some z G conv(F U {a}) . (A7) 

According to (Def. 2), the claim is proved if we manage to show that (A7) implies that x,y E conv(_F U {a}). The 
point z being an element of conv(F U {a}) means that 

z — fir + JIa for some n,]! > , fi + J1 — 1 , r F . (A8) 
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The case /i = is easy: In this case, Xx + Xy = a, but a is an extreme point of conv(C U {a}) and thus x = y = a E 
cohy{F U {a}). Therefore, we consider the more difficult case and assume that 

^^^0. (A9) 

Equations (A6), (A7) and (A8) give 

+'pa = Xap + J/3q + (Xa + X'^)a . (AlO) 

Now assume that 

JI- Xa-X^ =^0. (All) 
This assumption allows us to rewrite (AlO) in the form 

^ -.{Xap + X(3q ^ fir) . (A12) 



/.J — Xa — A /? 

It is easily checked that the right hand side of (A12) is an affine combination of p, q and r: 

Xa + Xf3 - fi _ Aa + (1 - A)/? - ^ 



71- AS -A/3 (l-/i)-A(l-a)-(l-A)(l-/3) 



= 1 



Thus, assumption (All) implies that a E a.S{{p,q,r}) C aff(C). This contradicts the premise that a ^ afF(C), so 
assumption (All) must be wrong and therefore 

■Jl=Xa + X^ (A13) 

Equation (A13) simplifies Equation (AlO) to 

fir = Xap + XPq (A14) 
Writing out 'JI=1 — ii,a = l — a and /3 = 1 — /3, it is easily checked that Equation (A13) implies 

— + — = 1 , where M ^ by (A9) . (A15) 
fj. fi 

We can rewrite (A14) as 

r = —p + —q with p,q <eC, r e F . (A16) 

Equations (A15) and (A16), together with the fact that F is a face of C, implies that p,q E F (c.f. (Dcf. 2)). Thus, 
by Equations (A4) and (A5), we have that x,y € conv(F U {a}), which completes the proof. □ 

Lemma 2: Let {A, A+,ua) be an abstract state space, let / £ Ea be a pure effect. If there exists a transformation 
T : A such that uaoT = f and T{uj) = uj for every cu € Ff (Dei. 21), then 

(a) dim Ff + dim F f < dim ^Ia — 1 and 

(b) aff (F/ U Ff) n = conv(F/ U Fj). 

Proof. Let (^4, A+,ua) be an abstract state space, let / S Ea be a pure effect and T : A ^ A he sl transformation 
with ua ° T — f and T{uj) = uj for every uj G Ff. For the rest of the proof, it is useful to write out (Def. 22) of a 
transformation and to list all the properties of T: 

T : A ^ A map such that 

T is finear, (A17) 

T is positive, i.e. T{A+) C A+ , (A18) 

UAoT = f (A19) 

ua{T{uj)) < ua{uj) Vuj eilA, (A20) 

r(w) = w Vw e F/ . (A21) 




FIG. 13. A few examples illustrating Lemma 2: The square violates condition (a). As we have demonstrated in the main text, 
this leads to a dimension mismatch for the transformation. The pentagon satisfies condition (a) but violates condition (b). In 
the main text, we have seen that this results in a non-positive transformation: The white parts of the pentagon (i.e. the parts 
outside of the gray area conv(_Fy U -F/)) are map outside of il^^ . The triangle satisfies both (a) and (b). 



Our goal is to show that properties (A17 - A21) imply (a) and (b) as stated above. 
First note that (A21) and the linearity of T (A17) imply 

T\spa.n{Ff) — -^lspan(Fj) , (A22) 

where /|span(F/) is the restriction of the identity operator to span(F^). On the other hand, by the definition of F f, it 
holds that /(w) = for all w G Ff, so by (A19), we have that UAiT{uj)) ~ for all uj G Fj. This implies T{u}) is the 
zero-vector for all lu G F f, so by the linearity of T (A17), this means that 

'^lspan("Fj) = 0|gp3^jj(p^) , (A23) 

where 0|gp^j^j;p^j denotes the restriction of the zero-operator to span(i^/). 

(a) For Equations (A22) and (A23) to be satisfied simultaneously, we must have that 

span(F/) n span(F/) = {0} , 

which is only possible if 

dim(span(F/)) + dim(span(F/)) < dim A (A24) 
Since ^ afr(F/) and ^ afr(F/) (Fact 12), (Fact 5) implies that 

dim(span(i^/)) = dim(F/) -f- 1 , 
dim(span(F/)) = dim(F/) + 1 . 

Noting that dim A = dimfi^ + 1, this allows us to rewrite Inequality (A24): 

dim Ff + dim F/ + 2 < dim flA + 1 

and therefore 

dim Ff + dim F f < dim Ha — 1 • 

(b) Equations (A22) and (A23) imply that 

TiFf) = Ff , 
TiFf) = {0} . 
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Thus, by (Fact 8), it holds that 

T(conv(F; U Ff)) - conv(T(F/) U T(Ff)) 
= conv(F/ U {0}) . 

By Lemma 1, the set conv(F/ U {0}) is a face of conv(51yi U {0}) = ^a^- This ahows us to apply (Fact 3) to see 
that 

conv(^> U {0}) = aff (conv(i^/ U {0})) n 

aff(FjU{0}) by (Fact 7) 

= afr(^^/ u {0}) n Qf 

and thus 

r(conv(F/ U Ff)) = aS{Ff U {0}) n . (A25) 
In the following, we show that this is equal to T(aff(F/ U F f) D Ha)- First note that 

T{aS{Ff U Ff) n Qa) = T(aff (F/ U Ff)) n T^Qa) ■ 
We can rewrite this term by means of (Fact 7), 

T{siS{Ff U Ff) ^ 8iS(T{Ff) U T(Ff)) , 
and by means of (A18) and (A20) (c.f. (A3)), 

TiflA) C nf , (A26) 

to get 

T{siS{Ff U Ff) n nA) = aff (T(F/) U T(Ff)) n T(^]a) ■ 

Ff {0} cosi 

Thus, 

T(afr(i^/ u Ff) n Qa) c afr(i^/ u {0}) n n^^ . (A27) 

Note that 7^laff(Fj:UF/) ^ bijection (this is easily verified from Equations (A22) and (A23)). Moreover, 

conv{FfUFf)<Za.S{FfLiFf)nnA. (A28) 
By virtue of (Fact 9), Equations (A25), (A27) and (A28) imply 

aS{Ff U Ff) n = conv(F/ U F f) , 
which is what we wanted to show. □ 
Lemma 3: Let A be an abstract state space, let 5* C ^Ia be any subset of the normalized states. Then 

Us {/ e -Ba I /(w) = 1 e 5} 

is a face of Ea- 

Proof. We have to check the properties listed in (Def. 2). Obviously, ua € Us, so Us is nonempty. If /i,/2 € Ea 
with /i(w) = /2(w) = 1 for all uj £ S, then \fi{uj) + (1 — A)/2(w) = 1 for aU A G [0, 1] and all uj £ S, so Us is convex. 
Let / S Us, let g,h & Ea and < A < 1 such that Xg + {1 — X)h = f. For any w e 5", we have that 

A g{uj) + (1 - A) h{uj) = 1 . (A29) 
<i <i <i <i 

Equation (A29) can only be satisfied if 5(0;) = h{uj) = 1, so g, /i G Us- □ 
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FIG. 14. This figure shows three examples where the set Us (as defined in Lemma 3) is visuahzed. As one can see, in all three 
cases, the set Us is a face of Ea- For simplicity of the picture, we have chosen the subset S C Qa to be a face of Qa- However, 
the fact that Us is a face of Ea also holds when S is not a face. In the square example, for instance, if we would take any 
subset of the edge S with at least two elements, then the set Us would still be the same. 



Lemma 4: Let A be an abstract state space, let F be a minus- face of Ha (Dei. 9). Then there is a unique pure 
effect / s Ea such that F is the certain face of / (Def. 21), i.e. Ff = F. 

Proof. First, note that any effect g with Fg = F must be an element of Up ^ {g G Ea \ g{io) = 1 Vw G F} since 
g e Up is equivalent to F C_ Fg. Let g G Up. By the linearity of the condition 

g{ij) = 1 Vw e F (A30) 

determines g on span(F). We know from (Fact 12) and (Fact 5) that dini(span(F)) = dimF + 1. Moreover, 
the premise that F is a minus- face of gives dim F + 1 = dim ^a — 1 + 1 = dim ^a = dim A — 1 and thus 
dim(span(F)) = dim A — 1. Thus, any functional g A* satisfying condition (A30) is fully determined by specifying 
its value at some point p ^ span(F). Let a, 13 € R, a ^ /3, and let ga,gp G A* be the unique functional satisfying 
(A30) and ga{p) = ct^gpiv) = respectively. Any g & A* satisfying (A30) lies in the affinc hull of g^ and gp since 
for g € A* with g{p) = 7, it holds that 

gyp) = 7 = — -.a +1 ^ p = — -s9a{p) +1 n gfsip) ■ 

a ^ p \ a — p J a — p \ a ~ p J 

Thus, 

UF^{geEA\ g{oj) = 1 Vt^ e F} C aff({g„} U {gp]) 
dim Up = 1. 

According to Lemma 3, Up is a face of Ea. Therefore, Up is a convex 1-dimensional set, which is nothing but a 
line-segment. A line segment has exactly two extreme points (namely its endpoints). By (Fact 1), these two extreme 
points are precisely the extreme points of Ea compatible with (A30) and therefore pure effects. The pure effect 
UA € Ea is obviously one of these two pure effects (for which Fu^ = ^a, see Fig. 15). Let the other one be denoted 
by /. Obviously, F C Ff since / £ Up. By (Fact 11), Ff is a face of i^A. However, by the premise that F is a 
minus-face of Ha and by (Fact 3), the only faces of Ha containing F are F and Ha. The latter can be excluded since 
f ^ UA. Thus, / is the unique pure effect such that Ff = F. □ 



4. The main result 

The following definition will be useful in the proof of Theorem 5. 

Definition: Wc call a polytopc P uniformly pyramidal (For a motivation of this naming, see Example 2.40 and 
Definition 5.3 in [8].) if for every minus-face F of P, it holds that P = conv(F U {ap}) for some ap G P. Note that 
in this case, it obviously holds that ap ^ aff(F). 

From a physical point of view, it would be sufficient to derive the result from the assumption that our postulate 
holds for every pure effect. However, in Theorem 5, we assume even less: Wc only assume the postulate for pure 
effects for which the certain face Ff is a minus- face of Ha. This is a weaker assumption and thus, wc prove a stronger 
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The only elements of Uf that The certain face of ua The certain face of / 

are pure effects are ua and /. does not coincide with F. does coincide with F. 

FIG. 15. This figure illustrates the proof idea of Lemma 4 by considering the case where f is a minus-face of a square-shaped 
set of states. 




P = conv(GU {ac}), 
but there is no 
an G P such that 
P = conv{H U {an}) 



an 

tetrahedron T pyramid P 

uniformly pyramidal not uniformly pyramidal 



FIG. 16. The tetrahedron T on the left is an example of a uniformly pyramidal polytope. The shape P on the right (formed 
like an Egyptian pyramid) is not uniformly pyramidal: it is only pyramidal with respect to its ground face G. For every other 
face F, there are two extreme points of the pyramid that are not contained in the face F, so the pyramid is not of the form 
conv(FU {ap}). 

statement. This will be useful in Appendix B. To prove Theorem 10, we will make use of Lemma 6, which is the 
contraposition of Theorem 5. This contraposition has the right form if we only assume the postulate for pure effects 
for which the certain face Ff is a minus- face of D,a- 

Theorem 5: Let A be a polytopic theory (Def. 23) satisfying the following weak form of our postulate: For every 
pure effect / S Ea for which the certain face Ff is a minus- face of ft a, there is a transformation T : A ^ A such that 
f ^uaoT and T{uj) = lo for every uj G Ff (Def. 21). Then A is a classical theory (Def. 24). 

Proof. We prove this theorem in two steps. 

(i) In the first step, we show that the assumptions imply that the polytope ^Ia is uniformly pyramidal (see the 
definition above). 

(ii) Then we show that a uniformly pyramidal polytope VLa must be a simplex, so ^ is a classical theory. 

Now we prove each of the two steps. 

(i) Let _F be a minus- face of VLa (which exists by (Fact 13)). We have proved in Lemma 4 that there is a unique pure 
effect / S Ea such that F is the certain face of /, i.e. Ff = F (Def. 21). Let T : A ^ Ahc & transformation 
such that / = UA ° T and T{ijj) = uj for every ui £ Ff. By Lemma 2 (a), we have that 

dim Ff + dim Ff= dim flA — 1 + dim F f 
< dim flA — 1 
dimF/ < 0. 
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aff(F;) 




FIG. 17. Here we see the geometrical entities that are discussed in step (i) ol the proof of Theorem 5 for the case where 
Qa is two-dimensional. The set _F is a minus-face of the states Qa- It is the certain face of a pure effect / € Ea, i.e. 
Ff = f. The impossible face Ff of / consists of a single point ap- It holds that aff(F/) n Ff — 0, from which it follows that 
aS{Ff (J Ff) n Qa ~ since Ff is a minus- face of Ha- 

Thus, Ff must be a singleton or the empty set (Def. 8). The latter is excluded since the pure effect / is different 
from UA (because = Ha F). Therefore, / (Def. 20) is nonzero and pure (Fact 10) and thus Ff ~ Fj is 
nonempty (Fact 11). Thus, 'Ff = {ap} for some ap £ ^a- Note that aff(F/) n F/ = since afF(i^/) CiQa^ Ff 
and FfnFf = 0. Thus, aff(F/ U F/) n = since dim Ff diml^A - 1 and F/ C \ afr(F/). Thus, by 
Lemma 2 (b), we have that 



The point ap is afHnely independent of F since aff(F/) n = 0, as we have already noticed. 

(ii) Now we show that every uniformly pyramidal polytope flA is a simplex. We prove this by induction over dimfi^i. 
The base case dim 57^ = is trivial: A singleton is a simplex. The case dim D,a = 1 is equally easy: Every 
one-dimensional polytope is a line-segment, and a line-segment is a simplex. 

Let Qa be a uniformly pyramidal polytope with dimfi^ ~ d > 2. Assume that every (d — l)-dimcnsional 
uniformly pyramidal polytope is a simplex (induction hypothesis). Let F C Qa be a minus- face of Qa- Since 
Qa is uniformly pyramidal, we have that Qa = conv(F U {ap}) for some ap G Qa with ap ^ aff(F). We want 
to show that Qa is a simplex. To this end, it is sufficient to show that F is a simplex since F being a simplex 
and Qa = conv(F U {ap}) with ap ^ aff(F) implies that Qa is a simplex (c.f. (Def. 12)). We show that F is 
uniformly pyramidal (by the induction hypothesis, this implies that F is a simplex). 



FIG. 18. This figure visualizes the definitions in the proof that every uniformly pyramidal polytope Qa is a simplex. 

The set F is a face of a polytope and therefore a polytope itself (Fact 14). Recalling (Def. 11), we see that the 
equation Qa = conv(F U {ap}) implies that 



Qa 



aS{FfUFf)nQA 
conv(FU{aF}). 




ne{QA) 



ne(F) + l. 



(A31) 



Let G be a minus-face of F (see Fig. 18). By Lemma 1, H := conv(GU {ap}) is a face of conv(FU {ap}) ~ Qa- 
The dimension of H is given by dim 7? = dim(conv(G U {ap})) = dimG + 1 = dimF = dimi7yi — 1, so iJ is 
a minus-face of Qa- Thus, since Qa is uniformly pyramidal, Qa — conv{H U {an}) for some an € Qa- This 
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allows us to see that 

ne{G) = ?ie(conv(G) U {ap}) - 1 
= UeiH) - 1 

= ne{conv{H U {an})) - 2 
= ne(riA) - 2 , 

so by (A31), 

ne{G) = ne{F) - 1 . (A32) 

Equation (A32) and the fact that G is a minus- face of F imply that 

F = conv(G U {be}) for some ba e F . (ASS) 

Since G is an arbitrary minus-face of F, (A33) implies that F is uniformly pyramidal. By the induction hy- 
pothesis, it follows that is a simplex, so flA ~ conv(i^U {ap}) is a simplex since ap is affinely independent of 
F. □ 
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Appendix B: Formal proof of the approximate case 

111 the following, we will give a formal prove of the approximate version of the result. "Approximate" means that 
this version shows that discrete (polytopic) non-classical theories are ruled out even if the postulate is weakened to 
an approximate version. Therefore, the result presented here is stronger than the original version of the result. The 
statement of the approximate version is more difficult to read than the original version. It reads: 

Let A be a polytopic non-classical theory (Dcf. 23) and (Def. 24) and let || • ||^ be any norm on A. Then 
there is a pure effect / e Ea and a positive number e with the following property; For every transformation 
T : A ^ A with f = ua°T, there is a state p G flA with f{p) = 1 and \\T{p) — p\\a > £• 

This statement has the form of a contrapositive of the original version. To see why this is a stronger version of the 
main result, the reader is invited to convince himself that if e is chosen to be just zero (instead of being positive) and 
the distance ||T(p) — p\\a > e is replaced by \\T{p) — p\\a > 0, then this statement boils down to the original version 
of the result. The statement above (with positive e) means that a polytopic non-classical theory cannot even satisfy 
our postulate "up to e" . 

As before, we organise the proof in a concise way by first listing all the definitions that we need in Section B 1 and 
by listing all the facts that we will use in the proof in Section B 2. This time, the lists are shorter since we do not 
repeat definitions and facts of Appendix A. In Section B 3, we give an overview over the proof idea and prove all the 
lemmas that we will need. Finally, we give the proof of the approximate version in Section B4. 

1. Notation and Definitions 

(Def. 25) For a normed space A and a closed subset C C ^, we define 

d( • , C) : A M 

X I— >■ d{x,C) := inf ||y— a;|| 

This map has the property that for all x E A, it holds that d{x, C) > with equality if and only if x e C. 

(Def. 26) For a vector space A, let End(A) denote the space of endomorphisms on A, i.e. the vector space of all linear 
maps from A to itself. 

(Def. 27) For an abstract state space {A, A+,ua) and an effect / G Ea, we define the set of all transformations that 
induce the effect / as 

Tf := {T e End(A) | T is positive, o T = /} . 

Note that since T G End(yl) implies linearity of T and ua°T = f implies Myi(r(a;)) < 1 for all oj e il^, the 
elements of 7/ are precisely the transformations (Def. 22) T : A ^ A with ua o T ~ f . 

If II • \\a is a norm on A, then the operator norm || • ||End(yi) induces a metric dq-g{S,T) := ||5 — T||End(A) on 
7/, which turns 7/ into a metric space. 

(Def. 28) For an abstract state space A, an effect / e Ea and a norm || • on A, we define the disturbance function 

Df.Tf^M. 

T I— > max ||r(a;) — wjl^ 

LO^Ff 

For every transformation T : A ^ A which induces the effect / (i.e. uaoT = /), the disturbance function 
evaluates the maximal disturbance on the certain face Ff of / (Def. 21) caused by the transformation. 

2. Known facts 

(Fact 15) For any norm-induced topology on a finite-dimensional vector space A, a polytope P C A is compact. 

(Fact 16) On a finite-dimensional vector space A, any two norms || ■ ||a and || ■ ||^ are equivalent, i.e. there arc positive 
constants ci,C2 such that ci||?;m < \\v\\'a ^ C2||fm for all v G A. 
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(Fact 17) (Heinc-Borel Theorem) In a finite-dimensional normed space A, a subset 5 C A is compact if and only if S 
is closed and bounded. 

(Fact 18) In a vector space A, the closure S* of a subset S C A coincides with the set of all limits of sequences in S 
that converge in A. Thus, a subset C A is closed if (and only if) every sequence in S that converges in A 
has its limit in S. 

(Fact 19) If / : X — > y is a linear map between between finite-dimensional normed spaces, then for every convergent 
sequence x„ — > .t g X, it holds that /(a;„) — >■ f{x) G Y. 



3. Technical lemmas 



In this section, we will prove four technical lemmas that will allow us to prove Theorem 10. To see where things are 
going, we first present a sketch of the organisation of the proof. As a corollary of Theorem 5 that we have proved in 
Appendix A, we will first show in Lemma 6 that for every non-classical polytopic theory, there is a pure effect / such 
that its certain face Ff (Dei. 21) is a minus- face of the states and such that there is no transformation that induces 
/ which satisfies our postulate precisely. We start the proof of Theorem 10 by considering a non-classical polytopic 
theory. We apply Lemma 6 which allows us to consider a pure effect / with the mentioned properties. 

For the actual proof of the theorem, we then make a distinction of cases (see Fig. 19). We first prove case (i) where 
we assume that the dimension of the impossible face Ff of / is zero-dimensional (in other words, consists of a single 
point). It is practical to consider this case separately since in this case, we can make a proof that considers a linear 
map L (with certain properties) which does not exists ii F f is higher-dimensional. 

The other case (ii) is the case where dim Ff > 1. Lemma 7 will help us to show that in this case, any transformation 
T that induces / (i.e. T G Tf, {Dei. 27)) must map Ff to a set T{Ff) of lower dimension than Ff. This implies that 
for every T G Tf, there is a state oj such that T{uj) ^ w and therefore ||T(w) — w||a>0. This wiU show that the 
disturbance function is a positive function on 7/. At this point, we will be left to show that Df is lower-bounded 
by a positive number e. We will show this by showing that 7/ is compact (Lemma 8) and that Df is a continuous 
map on Tf (Lemma 9). 




FIG. 19. This diagram shows how the proof of Theorem 10 is subdivided into several Lemmas. 



Lemma 6: Let A be a polytopic non-classical theory. Then there is a pure effect / G such that the certain face 
Ff is a minus-face of VIa and such that there is no transformation L : A ^ A with / = ua o L and L(ijj) = ui for all 

^ e Ff. 

Proof. Let {Fi}i^i be the set of minus-faces of D,a. By Lemma 4, for every i € I, there is a unique pure effect fi € Ea 
such that Ff. = Fi. Since A is polytopic but non-classical, we can apply the contraposition of Theorem 5 to see that 
there must he a k G I such that there is no transformation L : A ^ A with fk = ua ° L and L(uj) = w for all u £ Ff^ . 
Thus, / := fk is the effect we were looking for. □ 

Lemma 7: Let A he a non-trivial abstract state space (i.e. dim A > 1). Let / e Ea be a pure effect. Let T : A ^ A 
be a transformation such that / = ua ° T. Then, for the certain face Ff of / (Def. 21), we have 

dim(T(_F/)) < dim^ - dim Ff - 2 , 

where Ff is the impossible face of / (Def. 21) (Recall the convention dim(0) ~ —1 (Def. 8)). 
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Proof. The impossible face Ff is the subset of Ha where / vanishes, F f = {u! G ^Ia \ /('^) = 0}. Thus, the assumption 
that f = UA o T means that /(w) = for all lo G Ff implies T{Ff) = {0} (since the zero-vector is the only element 
of ri^^ with normalization equal to zero). By linearity of T, this implies r(span(F/)) = {0}, so ker(r) D span(F/) 
and therefore dim(ker(T)) > dim(span(F/)). (Fact 12) and (Fact 5) imply that dim(span(F/)) = dim(Fy) + 1 and 
thus dim(ker(r)) > dim(F/) + 1. Therefore, 

dim(im(T)) < dim A - dim(F/) - 1 . (Bl) 

On the other hand, the condition that f{uj) — ua{T{uj)) — 1 for all uj G Ff implies that T{Ff) C (rather than 
just T{Ff) C n^)- Therefore, 

T{Ff) C im(T) n C ini(T) n uSi^A) ■ (B2) 
In an abstract state space, we have that 

dim{a.S{flA)) = dim A - 1 . (B3) 

Moreover, 

e im(T) , but ^ aff (r^A) by (Fact 12) . (B4) 
We can combine (Bl), (B3) and (B4) to see that 

dim(im(r) n aS{nA)) < dim A - dim(F/) - 2 . 

Thus, by (B2), 

dim(r(F/)) < dim A - dim(F/) - 2 

as claimed. □ 

Lemma 8: Let A be an abstract state space, let / S Ea be an effect. Then, for any norm \\ ■ \\a on A, the space 
(7}',dr/) (Dcf. 27) is compact. 

Proof. Since End(A) is a finite-dimensional vector space, it is sufficient to show that 7/ is closed and bounded (Fact 
17). 

• Closedness: Let (T„)„ be a sequence in 7/ that converges in End(A), i.e. r„ — )• T e End(A). Closedness of 7/ 
can be shown by showing that the limit T is an element of 7/ (Fact 18), in other words by showing that T is 
positive and that ua oT = f. 

— Positivity: Let uj G A+. The map T„ i— > r„(a;) is a linear map from End(A) to A. Thus, since (Tn)n Q Tf 
is convergent, the sequence (r„(a;))„ C A^ is convergent as well, and the limit of (r„(aj))„ coincides with 
T{uj) (Fact 19). By the definition of an abstract state space (Def. 16), A+ is closed, so the limit T{uj) of 
(r„(a;))„ is an element of A^ (Fact 18), so T is positive. 

— UA o T ^ f: Note that the sequence (ua o Tn)n in A* is constantly equal to / and thus ua oTn ^ f ■ On 
the other hand, the map T„ i— ^ ua o T„ is a linear map from End(^) to A* , so ua o T^n — ^ ua ° T (Fact 19) 
and thus ua ° T = f . 

We have shown that T is positive and that ua o T ^ f . Therefore, Tn ^ T E Tf , so Tf is closed. 

• Boundedness: Since any two norms on End(A) are equivalent (Fact 16), it is sufficient to show the boundedness 
of Tf for a particular choice of a norm on End(A). Choose the norm HTjlu^ — sup^^g^j^ \ua{T{u!))\. It is easily 
verified that this indeed gives a norm on End(A) (for positive definiteness, make use of the fact that span(r2A) = 
A since A+ is generating). For all T S 7/, it holds that ua°T = f and thus |lr|l„^ = sup^gj^^ l/('^)l < 1: so 
Tf is bounded. 

We have shown that Tf is closed and bounded, so by (Fact 17), 7/ is compact. □ 
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Lemma 9: For an abstract state space A and an effect / € Ea, it holds that for any norm j] • \\a on A, the disturbance 
fmiction D/ (Def. 28) 



Df. Tf ^ R 

T n- max ||T(ti;) — cj|| A 
uieFf 

is a continuous function on Tf with respect to the operator norm || • ||End(A)i c.f. (Def. 27). 

Proof. This is easily calculated. Let I a be the identity operator on A. For any T, S* G 7/, we have that 



\Df{T)^Df{S)\ = 



max ||r(w) — uj\\a 

max II (T - Ia)^\\a 

LjeFf 



( max \\S{a) - a\\ 



i max 11(5 - /A)cr|U 

\creFf 



< 



max ||(r - /a)w|U - IK'S' - Ia)^^\\a 

uj^Ff 



\{T-Ia)lo\\a-\\{S-IaMa 



\\{T ^ lA)io - {S - IaPWa 



< max 

LuGFf 

< max 
u>eFf 



< max \\{T-S)uj\\a 

uiGFf 



< 



i max ||a;||A ) ||T - S'||End(A) , 



so Df is continuous. 



□ 



4. Proof of the theorem 

Theorem 10: Let A be a polytopic non-classical theory (Def. 23) and (Def. 24) and let || • \\a be any norm on A. 
Then there is a pure effect / S Ea and a positive number e with the following property: For every transformation 
T : A ^ A with f ^ ua o T, there is a state p £ Ha with f{p) = 1 and \\T{p) — p\\a > £■ 

Proof. By virtue of Lemma 6, there is a pure effect / e Ea such that the certain face Ff is a minus-face of Ha and 
such that 



there is no transformation L : A ^ A 
with / ~ UA o L and i(a;) = cj Vo; £ _Fy . 



(B5) 



This is the effect / for which we will show the existence of a number e > with the claimed properties. We make a 
proof by cases, where we distinguish between the cases where the impossible face F f of / (Def. 21) satisfies dimity = 
and where dim Ff > 1 (the case dim Ff = — 1 is not possible since Ff is a minus- face of Ha). 



(i) Assume that dim Ff =0, i.e. 



{w/} for some ujf €z Ha ■ 



Since w/ ^ span(J/), it holds that span(F/) n span(F/) — {0}. (a;/ ^ span(i^/) can be verified using the fact 
that span(F/) = aS{Ff U {0}) (Fact 4), Ff = aff (F/) n Ha (Fact 3) and ua(w/) ^ 1 but ua{0) = 0.) Thus, there 
is a linear map L : A ^ A with 



L{ll!) ~ Lo Voj e Ff . 
L(Ff) = {0} . 



(B6) 
(B7) 



28 



Note that 



(since Ff C is a minus- face) . Thus, 



dim Ff = dim A - 2 



(B8) 



dim(span(F/)) + dim(span(F/)) = diuiFf + 1 + dimF/ + 1 

= (dimA-2) + l + + l 
— dim A , 



by (Fact 12) and (Fact 5) 
by (B8) 



so the conditions (B6) and (B7) fully determine the linear map L : A ^ A. It also means that (B6) and (B7) 
imply that / = ua o L on a set that spans A, and thus / = ua ° L everywhere. Thus, by (B5), L cannot be a 
transformation, so by (Def. 22), linearity or L{^a) C fi^^ must fail for L. However, we have constructed L to 
be linear, so L{VLa) ^ must fail. Thus, there is a r € ^a such that 



l(t) i n 



<i 

A ' 



i.e. d{L{T),n2^)>0 



(B9) 



where d{ ■ , ^a^) is defined in (Def. 25). Define 



dA := dim A 



and recall our definition of the dimension of a set, (Def. 8). It holds that r G ^a ^ aff(F/ U Ff). (This follows 
from the fact that Ff is a minus- face of ft a and Ff,Ff C flA but Ff ^ aS{Ff).) This implies that there must 
be diuiFf + 1 = (dim A — 2) + 1 = (d^ — 1) points {cji, . . . , C Ff such that 



T e aff({aji, , 

/dA-l 



■ ,ujdA-i,^f}) ^ i-e- 



dA 



a^A^ f for some coefficients {ajjfi^x with — 1 



(BIO) 



We define 



l,...,dA~l}. 



Note that Qmax is positive by (BIO) since t ^ ujf (we have chosen r such that L{t) ^ , but Liuif) = G fi^ 
Let T : A ^ Ahe any transformation with / = ua ° T . Assume that 



\\rr( \ II / ) w ^ p 

[a A - l)amax 



(Bll) 



We will show that this leads to a contradiction to the assumption that T is positive. This, in turn, will show 
that the term on the right hand side of Inequality (Bll) is the e with the claimed property (note that the term 
is independent on the choice of the transformation T). 

Together with Assumption (Bll), we can use the triangle-inequality for the norm to derive the following bound: 



|lr(r)-L(r)|U = 



aiT{uj^) \ +adAT{i:Jf)~ ^ ajL{uj) j - adAHuJf) 



by (BIG) 



< 



dA~l 

El 



|T(c^O - L{u,) \\a + \adA\ II TioJf) - L{u]f) \l 



dA-l 

1=1 



< {dA - l)an 



d{L{T),Qf) 
{dA - l)amax 



by (Bll) 
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and therefore 



\\T{r)-L{r)\\A<d{L{r),nf). (B12) 

<i 



"A 

For any a € f^^^, we can use the triangle inequahty again to derive the foUowing inequahty: 



\\L{t) -a\\A< \\L{r) - T{t)\\a + \\T{t) - a|U 

\\T{t) a\U > \\L{t) a\U WLir) - T(r)|U • (B13) 



This ahows us to conclude 
■><i 



diT{T),nf)= min ||r(r)-a|U 
renY 



> min [\\L{T)-a\\A-\\L{T)-T{T)\\A] by (B13) 



and therefore 



= min ||L(r)-alU -|lL(r)-T(T)|U 

>d{L{T),nf)~d{L{T),nf) by(B12) 



diTiT),nf)>o. 



Thus, assumption (Bll) implies that there is a r G fi^i which is mapped outside of fl^^ by T, so it implies that 
the map T is not positive. But T is a transformation and therefore positive, so the assumption (Bll) must be 
wrong. The negation of (Bll) is 

^peFr.\\Tip)-p\\A>§^^. (B14) 
The fact that p E Ff means that f{p) = 1. Set 

^ _ d{L{T)Mf) ^ 
{dA - l)Q!max 

This is a positive number by (B9). Since e is independent of T and T is an arbitrary transformation with 
/ = UA o T, (B14) means that we have proved the claim for the case where AitclF f — 0. 

(ii) Assume that dimF/ > 1. Let T : Ahc & transformation such that f ~ uaoT. According to Lemma 7, we 
have 

dim(T(i^/)) <dim^-dimF/-2 

< ~ 3 , where dA '■= dim A . 

Note that since VIa is a non-classical polytope, it holds that dim A > 3 (for dim A = 1, 2, the set VIa is a point or 
a line, respectively, which both are a simplex and therefore classical). The inequality shows that the minus- face 
Ff of ^A^ which is a polytope with dimension dimF/ = — 2, is mapped to a set T{Ff) which is at most 
{d-A — 3)-dimensional. Diagrammatically, 

F^ ^ T{Ffl crtA. 

(d^— 2)-dim. < — 3)-dim. 

Therefore, Ff cannot be contained in T{Ff). In particular, there must be an w £ Ff such that ||r(a;) — wjl^ > 0. 
Since the map uj i— >■ ||T(a;) — cj||a is a continuous function and the polytope Ff is compact (Fact 15), the map 
attains a maximum on Ff. So far, we have shown the following: For every transformation T : A A with 
/ = UA ° T, it holds that max^g^T'^ l|T(w) — is positive. Recapitulate our previous definitions (Def. 27), 
(Def. 28). The transformations under consideration are given by 



Tf = {T e End(A) I T is positive, uaoT = f}, 
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This definition allows us to write the disturbance function Df as a function on 7/: 

Df : Tf ^ R 

T I— > max ||T(ti;) — 

Using these definitions, we can summarize what we have proved so far by stating that the disturbance function 
D/ is a positive function on 7/. From Lemmas 8 and 9, we know that 7/ is compact and that Df is continuous on 
Tf. A continuous function on a compact space attains a minimum, so min^gT-^ D f{T) exists. Since the minimum 
of a positive function must be positive, it holds that 



min Df 

TeTf 




Set 



e := min ( max \\T(uj) 
TeTf \ueFf 




Thus, e is a positive number such that for every T ^ Tf, there is a p G Ff such that 



\\T{p)-p\\A>e. 



Writing out the definitions of 7/ and Ff, (Def. 27) and (Dcf. 21), we have proved the existence of an e > with 
the property that for every transformation T : A ^ A with ua oT = f , there is a p € ^Ia with f{p) — 1 such 
that ||r(/3) — pIIa > e, so we have proved the claim. □ 



